eRe! 
CHAPTER 


BACKING Up System DATA 


After reading this chapter and completing the exercises, 
you will be able to: 
+ Understand the issues surrounding backups and back-up strategies 
+ Discuss hardware and software issues related to backing up Linux 
data, such as back-up devices and storage media choices 
+ Use popular back-up utilities such as tar, cpio, and graphical back- 
up utilities 


n the previous chapter you learned about managing the printing services on 

Linux.You learned how to create a printer configuration file and how to manage 
print queues usng command-line or graphical utilities You also learned how to 
print to remote print servers and to printers attached to computers running other 
Operating systems. 


In thischapter you will learn about backing up data tored on a Linux sytem.You 
will learn how to plan effective back-up strategies for different types of systems 
and environments. You will also learn about the hardware components, back-up 
media, and software utilities available to help you create and manage backups of 
your data. 


BACK-UP STRATEGIES 


As you learned in Chapter 9, no matter how many precautions you take, all computers are 
Subject to failure. T hus, making backups of essential files is a form of insurance. In its implet 
form, a backup is nothing but a copy of data on a computer system. H owever, backing up 
thousands of files owned by dozens or hundreds of users is not a simple process. But because 
the time and equipment needed to make backups are minimal compared with the costs aso- 
ciated with lost data, nearly all organizations regularly back up the files on their computer 
systems according to an established back-up plan.A back-up plan is a written document 
that outlines when, how, and, perhaps, why various files and file sytems will be backed up, 
stored, and— when necessary— restored.As you might guess, implementing the back-up plan 
normally fallsto the sytem administrator. 


511 


512 


Chapter14 Backing Up System Data 


Among other things, the back-up plan should specify the type of back-up media to be used. 
T he term back-up media refers to the device that stores the backed-up data, such as a tape 
cartridge (the most common format), writeable C D, or even a floppy disk.T he back-up plan 
Should also specify how lost data should be restored. T he term restore refers to the process 
of copying data from a back-up location (for example, a tape cartridge) onto the file system 
where that data is normally used, and from which it was unintentionally log. 


Because of the complexities involved, developing a back-up strategy that works well in any 
organization is an ongoing process As a sytem administrator, you can expect to work with 
numerous existing and new computer systems, a variety of applications and data storage needs, 
and computer users whose preferences and actions are rarely predictable.T he following sections 
address some of the questions that you should consider when formulating a back-up plan. 


Asking Initial Q uestions 


Some of the initial questions that you'll want to consider as you formulate a back-up plan 
include the following: 


W hat files should be backed up? You might initially think that everything on the system 
needs to be backed up. Although that’s an admirable goal, time and cost restrictions 
might make it impractical.You can evaluate various parts of your system to deter- 
mine what data is easily restored from C D, such as the operating system or an 
application. If you are hort on resources, these items can be re-created (and then 
reconfigured) from their original sources rather than from a backup that you create. 


W ho will back up files? As mentioned previously, this normally falls to the sytem 
administrator.You may, however, decide that users on a networked system have 
some responsbility. Perhaps users should be informed that only data placed in a 
certain directory area will be backed up each night. O r a yem administrator may 
share the responsbility for backups with a colleague, either to reduce the work 
burden on one person or to make backups more accessible in case they are needed 
for restoring data. 


W here are files located? You probably know offhand where most of the different 
types of data are located on your Linux systtem.A more thorough approach can 
help you see which specific directories on the system are being actively used, 
which contain data that is easly reconstructed, and which hold temporary files 
that don't warrant the effort of a regular backup.T hese are jug three examples of 
the categories you might assign to parts of your system as you review the various 
file systems and devices that store data. 


H ow should backups be performed? T he answer to this question may be determined 
by the equipment you purchase, as well as by how your organization operates its 
computer systems. M any system administrators must back up data during non- 
work hours.T his process can be automated in most cases using a cron job (see 
Chapter 12).You might also want certain events to trigger a regular backup, or 

a different type of backup than would normally occur. For example, you might 
want to back up the entire system before installing new hardware devices such as 
SCSI adapters. 


Back-up Strategies 513 


= Must you be able to restore data within a spedfic period of time? W hen a problem occurs 
(and it will), several factors affect how rapidly you can restore lot data to the system. 
T hese factors include the sze and location of the lost files and the media format on 
which the back-up data was stored.Y our backup plan should reflect the importance 
of timing in your organization. In some organizations, the ability to restore lost data 
immediately is essential. In others, peed may not be quite as critical. 


A well-desgned back-up plan will make it easy and convenient for you to regularly back up 
system data and restore files. Ideally, your back-up plan should prevent the headaches asso- 
ciated with having to locate files and figure out how to reconstruct damaged or lost data. 


Determining the Value of Data 


Aswith creating redundant systems, your back-up strategy should be based at least partially 
on the value of the data that you are backing up. The more expengve data is to create, 
acquire, or refine, the more you should spend to protect its integrity. Some data may only be 
valuable to one person in an organization, but if that person’s time is required to re-create any 
data that is lost, the data still has value to the entire organization. 


Asan example, a study of the value of data held by an organization might determine that a 
given set of files required 4,000 hours of work by the employees of the firm to create A 
different estimate might state that the data could be re-created given current experience and 
facts in about 2,000 hours If the average wage of the employees involved in the project is 
approximately $40 per hour, the data would have a value of $80,000. But the tudy doesn't 
end there. 


The estimate of 2,000 hours— about one work- year— is based on an experienced employee 
re-creating the data. If that well-trained employee spends time re-creating log data, what cur- 
rent work will he or he not be able to do?T his is called the opportunity cost.T he employee 
might forgo a project worth many times $80,000 in order to re-create the lot data. 
O pportunity cog extends even further. H ow was the data that was log going to be used? Was 
it part of a multimillion dollar advertisng campaign? O r perhaps a financial merger? A great 
deal of money may be log because the data is unavailable when needed. Even if $80,000 can 
be invested to re-create it, the moment of opportunity when the data was needed may be past. 


create a project that was partially or completely finished. These are also key factors in 


This discussion doesn’t address the anger or low morale of an employee who must re- 
any organization. 


T he following list summarizes questions to ak when determining the value of data: 
=» How many hours of effort were spent creating the data? 
= How many hours of effort would be required to re-create the data? 


=» How much inherent value does the data contain for the operation of the 
organization? 


514 


Chapter14 Backing Up System Data 


a Isthe data irreplaceable? 
a |sthe data time critical to a current project? 


T hese consderations are similar to those raised in the discusson of hardware redundancy and 
fault tolerance in Chapter 9.T he decisions you make as a sytem administrator are also Smilar 
to those you might make when evaluating your system's hardware: if data is worth millions of 
dollars to your organization, don’t hestate to pend $50,000 to $100,000 to protect that data. 
By answering the questionsin the preceding list, you may be able to convince company offi- 
cers or supervisors that the expense is warranted. W ith the right hardware and software tools, 
you'll be well prepared to secure the information that you safeguard as a system administrator. 


Determining When to Back Up Data 


Once you have created an initial backup or archive of important data, the question of how 
often to refresh the backup arises. H aving at least one backup of data is better than having 
none at all, but data changes frequently in most organizations Continually backing up the 
latest information stored on the system isa critical part of mog sytem administrators jobs. 


T he question of when to back up data is related to how valuable the data isto an organization. 
You need to start by asking, “H ow often does the data change?” Another good question to ask 
is this “Do changes to the data affect the value of the data?” 


T he answers to these questions vary, depending on which part of your Linux system you are 
evaluating. T he operating system itself probably changes very little after your initial configu- 
ration efforts Applications installed on the system are also unlikely to change regularly. By 
contrast, user data, log files, and other items change rapidly and are normally the focus of 
back-up efforts T his data constitutes the daily work of others within your organization. By 
maintaining regular backups, no one is ever likely to lose more than a few hours worth of 
work, even if the entire system crashes or a hard disk is destroyed. 


Several back-up strategies are commonly used.You can select a strategy based on how often 
data on your system changes and how valuable or critical each incremental piece of data is. 
T he following discussion describes a widely used back-up strategy for Linux. 


A Linux Back-up Strategy 


Various strategies have evolved among Linux users to balance the need for a complete 
backup of data at all times with the need for convenience in creating and maintaining back- 
ups T he method described here is accepted as tandard for mog Linux and UNIX systems. 
You can adjust the time frame according to how often the data on your system changes. 


Back-up Strategies 515 


Using Back-up Levels 


To understand this back-up method, you need to understand the concept of a back-up level. 
A back-up level defines how much data is to be backed up in comparison with another 
back-up level. A back-up operation at a given back-up level copies all of the data that has 
changed since the last backup of the previous level. For example, a backup at level 1 stores all 
files that have changed since the last level 0 backup; a backup at level 2 stores all files that have 
changed since the last level 1 backup. A standard sytem might operate with three levels, as 
described here: 


a Level 0 isa full backup. Everything on the system is backed up. Suppose for this 
example that a level 0 backup is performed on the first of every month. 


m A level 1 backup is done once per week. Every file that has been modified since 
the last level 0 backup (on the first of the month) is included in the level 1 backup. 
T hisis referred to as an incremental backup. 


a A level 2 backup is done each day. Every file that has been modified since the first 
of the week (the last level 1 backup) is included in the level 2 backup. Like a level 1 
backup, this is consdered an incremental backup. 


Figure 14-1 illustrates the three-level backup just described. 


Beginning of O ne week One week and 
month later one day later 


Data on 
hard disk | Ty 
Data on 

back-up media == 2 


Level 0 Level 1 Level 2 
(full) (incremental) (incremental) 


Figure 14-1 Back-up levels 


T he times associated with back-up levels are arbitrary, though a level 0 backup is normally a 
full backup in which every file is backed up. Beyond that, you can assgn any time frame that 
you choose to each level; the point is amply that each level records all the changes since a 
backup of a previous level. 


T he advantage to using back-up levels is that you can back up data frequently— meaning 
very little work is lost if a system fails— but you don’t have to back up the entire system each 
time you do a backup. 


516 


Chapter14 Backing Up System Data 


Restoring a File from a Three-Level Backup 


N ow consder how a system administrator would restore a file that a user had deleted and 
needed help recovering. T he user can’t recall when the file was last modified, but it was 
“recently.” The system administrator follows these steps to locate the file: 


1. Check the mos recent level 2 backup. If the file is there, it was changed in the last 
day.T his backup probably doesn't include very many files compared to the se of 
the entire system, so it’s easy to search for a file. If the file in't there, then it wasn't 
modified in the last 24 hours, so proceed to Step 2. 


2. Check the mos recent level 1 backup. If the file is there, it was changed sometime 
after the first of the week, but not in the last 24 hours T his backup contains more 
files, so it takes a little longer to search. If the file is not found, proceed to Step 3. 


3. Check the mos recent level 0 backup.T he file will always be located on this 
backup because a full backup includes every file on the system. But searching 
through this backup may be time consuming because it is fairly large. 


Back-up media such as tape drives and optical disks always have directories of their 

contents to help you locate files as rapidly as possible, but a tape cartridge must be 
rewound to the place where the file is stored. As a result, restoring a single file from a 
tape cartridge can still be time consuming. 


You may wonder why you shouldn't start searching for the file in the level 0 backup. You 
should always start with the most recent backup in order to find the mos recent verson of a 
file If the file had been altered since the first of the month, the most recent copy of the file will 
not be on the level 0 backup. H ence you Should start with the most recent backup (level 2) to 
se if the file is located there. 


Advantages to the three-level back-up method include: 


a Creating the level 2 daily backups requires little of the sytem administrator’s time 
because few files are altered on any given day. 


= No user will ever lose more than a single day's work because the changes in the 
file system from each day are recorded in a level 2 backup. 


a Files that rarely change are gill backed up and available, but don’t require daily 
maintenance by the sytem administrator. 


Some back-up utilities explicitly use the term back-up levels to refer to how data is backed up 
and how back-up media are tracked.T he concept can be applied to any utility, however. For 
the sytem to work well, you need to keep careful records and label back-up media clearly. 


Back-up Strategies 517 


In the event that an entire system must be restored using a set of back- up media that have been 
prepared using the three-level method, a system administrator would follow this procedure: 


1. R estore everything from the latest level 0 backup. 
2. R estore everything from the latest level 1 backup. 
3. R estore everything from the latest level 2 backup. 


Figure 14-2 illustrates how this procedure will result in all of the latest information being 
included in the restored file system. (C ompare the back-up levels pictured in Figure 14-1 to 
the restore operation pictured in Figure 14-2.) 


Contents of 
hard disk 


After level 0 
restore 


Then a level 1 | 
restore 


Then a level 2 
restore 


Figure 14-2 Restoring data from a set of back-up media with levels 0, 1, and 2 


M anaging and Storing Back-up M edia 


Asyou create a back-up plan that specifies back-up levels and times appropriate to your needs, 
you must determine how many back-up media you will need (disks, tapes, cartridges) for each 
level. T hat is, a level 0 full backup may require five tape cartridges, but a typical level 2 backup 
requires only a Single cartridge (because relatively few files are modified each day). As an 
example, the three-level backup described previously might include the following: 


m T hree months of level 0 backups each requiring 5 tape cartridges, for a total of 15. 


m Five weeks of level 1 backups (some months have five weeks); each requiring 3 tape 
cartridges, for a total of 15. 


a Five days of level 2 backups (you might need seven days if your organization runs 
seven days per week); each requiring 1 tape cartridge, for a total of 5. 


518 


Chapter14 Backing Up System Data 


You would therefore need a total of 35 tape cartridges Figure 14-3 illustrates this arrangement. 
T he importance of carefully labeling each tape cartridge cannot be overstated. If you can’t iden- 
tify which back-up media is the most recent of any given level, much of your back-up efforts 
will be useless when a serious problem arises. 


Level 0 Level 1 Level 2 


Month 1 


= 
© 
A 
a 
lad 
axl 
A 
2 
(v 
2 
= 
A 
2 


(C 
PA] 
A 
= 
A 
< 


2 Five Three One 
tapes 3 CK E D) tapes aC J tape 
each _— | each — each 


D 
PR] 
[E 
FA 
= 
> 
Fq 
= 


ul 
Ta 
Tal 

A 

= 
ul 
Fq 
tne! 


Figure 14-3 Multiple tapes used for a three-level back-up plan 


M os organizations would store one set of the monthly (level 0) back-up media and perhaps 
the most recent weekly (level 1) back-up media off-site. The strategy for off-ste storage 
depends on how critical data is and how often someone wants to take the responsibility of 
carrying the back-up media to the chosen secure location (such asa bank vault). 


M ost back-up media are designed to be used repeatedly, like a floppy disk. For example, 

a rewriteable CD can be used about 1000 times, according to the manufacturer. But you 
should nevertheless plan on a scheduled life for back-up media, so that you avoid 
problems with deteriorating, outdated products. 


U dng the plan just outlined, you could reuse the same set of level 1 weekly tape cartridges 
each month, starting with the oldest one.T he same applies to the level 2 tape cartridges. For 
example, on any Wednesday afternoon, you should have five level 2 tape cartridges containing 
the following: 


a Last Wednesday's backup, which you will overwrite this evening with new data 

a LagT hursday’s backup, which you will overwrite tomorrow evening with new data 
a Last Friday's backup 

= Monday's backup (from two days ago) 

= Tuesday's backup (from last night) 


Hardware and Software Issues 519 


In addition to being fairly easy to manage, this system provides data redundancy. If you 
have a problem and need to restore a file on this particular Wednesday, you first check the 
Tuesday backup that you made last night. If a problem occurs with that tape cartridge, you 
can also check M onday morning's level 1 backup, M onday evening's level 2 backup, or 
even last Friday’s backup. A user may lose more work if you cannot use the most recent 
backup, but the user is unlikely to lose everything, because many copies of the file exist, 
created at different times. 


Backing Up the Root File System 


T he root file system requires special attention in your back-up plan because it contains the tools 
that you normally use to restore damaged data, such as a deleted file or data from a corrupted 
hard disk partition. You must therefore think about how to respond if the root file sytem is 
damaged, either by ahard dix failure or by corrupted configuration files that prevent you from 
booting the Linux operating system kernel. 


Chapter 9 described how to create a rescue floppy or a boot floppy. T hat disk, which you 
can use to boot the system in an emergency, should contain the files necessary to restore the 
contents of the root file system from your back-up device T hese files might include: 


m T he kernel modules needed to access the back-up device (such as SC SI modules) 


m Other kernel modules needed to access the device where the root file system 
is stored 


= Configuration files needed to set up access to the back-up device 


a Information such as file indexes that are needed to locate the correct data and 
restore it from back-up media 


Asyou prepare a back-up plan, you'll want to consder the hardware and software that you'll 
use to implement that plan.T he next part of the chapter describes some key issues you face 
in making hardware and software choices. 


HARDWARE AND SOFTWARE ISSUES 


Once you have determined why, when, and how you want to back up your Linux system, 
you must determine the best tools to use to get the job done. Linux includes all the necessary 
software utilities for many back-up tasks You can also purchase commercial back-up soft- 
ware. Both of these options are described later in this chapter. 


M any different hardware devices are available for backing up data. T he next few sections 
provide a review of the different options available to you. 


Choosing Back-up M edia 


T he sze of hard disks in standard PC sis growing very rapidly. W hereas a 500 M B hard dix 
was consdered huge just a couple of years ago, hard dixs with 50 GB— 100 times that 
amount— are now available for well under $1000. Storage space is often measured according 


520 


Chapter14 Backing Up System Data 


to its cot per megabyte. For example, if a16 GB hard dik costs $400, the cost per megabyte 
is about 2.5 cents. Similarly, if a tape cartridge used for backing up a system costs $79 and 
holds 20 GB, the cog per megabyte is 0.38 cents per megabyte. 


When you back up your data, you will normally have multiple copies of the data that was 
backed up at different times. Back-up media such as tape cartridges generally cost much less 
than a hard disk or other smilar device, but you must purchase multiple tape cartridges to 
back up the system. 


Unfortunately, back-up devices have not kept pace with the growth in capacity of hard disks, 
though many different formats and devices are available for sytem backups, as described 
here. The following paragraphs review the different back-up media (and corresponding 
devices) that you might consider for backing up your Linux system. 


M agnetic Media 
Several types of magnetic media are occasonally used for specialized back-up needs. 


m Floppy disks: although you might be surprised to see this item listed, floppy disks are 
a great way to back up mall, senstive pieces of information. For example, a boot 
disk, a rescue disk, a firewall or other server configuration, and other smilar data can 
easly be copied to a floppy disk.T he disk is inexpensive, easly transported, and eas- 
ily tored. Jug be certain to label the disk and move the write- protection tab over 
so you don’t erase the floppy disk.An important disadvantage of floppy disks is that 
they are fragile You should maintain multiple floppy disk copies of any critical data 
and check the integrity of the disks regularly. 


a RAID hard disks: most of the data that you want to back up is already on a hard 
disk. It doesn’t make sense to rely on long-term data storage located on another 
hard dixk— even aR AID array— if the same vulnerabilities apply to that device as 
to your main hard disk. On the other hand, storing a back-up copy of crucial data 
from several locations on a centralized R AID array is a useful way to maintain an 
online backup— that is, a backup of the data that is gill available if one of the hard 
disks becomes unavailable. In general, however, don’t plan your back-up strategy 
around this sort of thing. Instead, look to removable devices such as tapes and 
removable cartridges. 


= Removable media: many types of specialized cartridge storage devices are now 
available. T hese include Syquest cartridges, Floptical and similar devices that store a 
large amount of data on a mall disk similar in sze to a floppy disk, and various 
products such as the Zip and Jaz cartridges from |omega.T he data capacity of 
these cartridges continues to rise.T he latest Jaz cartridges hold 2 GB each. 


R emovable media, the lat item in the preceding list, have several advantages, including the 
following: 


= R andom, immediate access to any point on the media, Smilar to a hard dix 
a The ability to expand storage by purchasing additional cartridges 


Hardware and Software Issues 521 


m R elatively easy access to the back-up device— most are treated like a tandard hard 
disk, formatted with the ext2 file system, and mounted normally. 


R emovable media also have disadvantages, such as: 
a High cos per megabyte of torage 


m Proprietary formats (compared to most tape backups), which may mean difficulty 
obtaining new cartridgesin the future and lack of support from other vendors 


Optical Media 


O ptical media used by devices such as writeable CD drives and DVD drives are an attractive 
back-up choice Advantages of optical media include: 


m Their large storage capacity is sufficient for many needs. 

m Storage media are very low cos. 

a Storage media are widely available. 

= Optical media are easly exchanged with vendors, customers, or other organizations. 


Standard CDs, in particular, are a valuable method of exchanging large amounts of data with 
suppliers and also of easly creating data archives. Because a Sngle writeable CD costs less 
than a dollar, it is cost effective to back up key data files regularly on a CD and to havea set 
of back-up CDs stored with snapshots at various times. R ewriteable C Ds, which you can 
update in the same way that you update data on a hard dix, cost a little more but provide 
more flexibility.T he capacity of aC D isonly about 640 M B.T hat's not much compared with 
the data tored on an entire hard disk, but it’s often sufficient for backing up an entire project 
directory, graphics archive, programming project, or operating system. 


DVD drives which are increasingly popular for watching movies on a computer, also come 
in awriteable format called DVD-RAM.A DVD-RAM cartridge (costing under $50) holds 
about 5.2 GB of data Drives are inexpensve as well. For data sets too large for writeable 
CDs, the low cost and wide us of DVD make it an attractive choice. 


Tape Cartridges 


Tape drives are the workhorses of mog computer back-up efforts. Tape drives are fairly inex- 
pensive, as is the media (tape cartridges). M any formats are available, but in general, data 
Capacities have kept pace with that of hard disks T hus you can purchase a tape drive that will 
record 8, 40, or even 100 GB on a ïngle tape cartridge. All such cartridges are priced under 
$100, with the smaller capacities coding far less. If you need to back up large amounts of 
data, such as hundreds or thousands of gigabytes, you should consder special tape cartridge 
jukeboxes or high-end digital tape formats available from major device manufacturers such as 
IBM and H ewlett-Packard.T he term jukebox refers to a back-up device that holds multiple 
back-up media (such as multiple tape cartridges or writeable C Ds) and can switch between 
them without assistance from a system administrator. 


522 


Chapter14 Backing Up System Data 


Tape drives are available in a variety of formats, and new formats seem to appear each year as 
manufacturers rush to keep up with growing capacities and speedier computers. M anufacturers 
of the latest tape drives claim storage capacities of up to 200 GB on asingle tape cartridge; oth- 
ers claim data transfer rates in excess of 200 M B per minute. Explaining the features of a diver- 
gty of tape formats is beyond the scope of this book, but the information that follows provides 
enough basic details to familiarize you with the formats you're likely to see. 


Keep in mind that tape cartridges can accommodate different methods for storing data, 
depending on the tape drive you use.T his is smilar to a regular 3.5-inch floppy disk, which 
can be formatted with either an MS-DOS, M acintosh, or Linux file system. 


W hen reviewing the great number of tape devices on the market, you may feel overwhelmed 
by the alphabet soup of formats, companies, and product names.T he following list describes 
some major tape cartridge device types and data formats. 


a Digital LinearTape (DLT ) isa half-inch- wide tape inside a cartridge. T he tapes store 
up to about 40 GB and are consdered highly reliable Q uantum is considered the 
leader in DLT technology, but many others, such as StorageT ek, also use DLT. 


m Linear Tape-O pen (LTO) isan open tape standard used by H ewlett- Packard, IBM , 
and Seagate (a prominent hard disk manufacturer). M any companies are currently 
planning devices based on this high- capacity format. 


= Helical-scan tape drives write data onto a thin tape— either 4mm or 8mm.T his 
storage format is the same method used by videotapes for recording movies. 
Figure 14-4 illustrates how a helical-scan device stores information by writing 
short, angled strips of data on the tape. H elical- scan tapes (usually the 8mm size) 
are used in several newer tape formats as described below. 


a Advanced IntelligentTape (AIT ) isa format developed by Sony. Each AIT cartridge 
contains amemory chip that is used to increase the efficiency of data access. Sony 
plans to release a revised verson of AIT every two years, with a doubled storage 
Capacity and data transfer rate in each new version. AIT-3 tapes are expected to 
hold 100 GB and transfer data at about 720 M B per minute (AIT-3 devices are not 
available at the time of writing.) 


= VXA isatechnology developed by the Ecrix company.T heV X A format attempts 
to overcome some of the technical limitations that most other standard formats 
face. For example,V X A avoids stopping and carting the tape drive while waiting 
for the computer to send more data by using a variable speed tape drive and 
organizing data into packets rather than a single stream, as most formats use. 


= Travan tape drives are widely used and are manufactured by many different com- 
panies. T hey do not have high capacities— 10 to 20 GB is tandard— but they have 
a longer history of reliability than many of the newer formats. Travan uses the Q IC 
tape cartridge format. 


Hardware and Software Issues 523 


Stripes of information 
written to tape 


Ua 


Spinning 
read/write mechanism 


Data tape 


I 


Figure 14-4 Using the helical-scan method to write data on a 4mm or 8mm tape 


After reviewing the available formats, media, and devices in this section, you can use the 
information in the next section to help you determine what factors to consder when choos- 
ing a back-up device. 


Comparing Devices 


Deciding among all of the available back-up devices and technologies is challenging. 
System administrators who are creating a new system from scratch and need to store large 
amounts of data may be able to focus on the latest technology for high-capacity, high- peed 
tape drives. O ther system administrators may be more concerned with sharing copies of 
data between several existing computers usng a CD drive, and % may opt for alow-cot 
CD recorder. Still other administrators may be forced to purchase new devices that use 
older technologies smply to keep costs low or maintain compatibility with existing systems, 
even though this means much more work to maintain numerous back-up media. T hese are 
just three examples of the varying requirements that have led manufacturers to develop so 
many different devices and formats. 


In most cases, the cog of the back-up device and the back-up media is an issue, at least 
peripherally. Although you should keep in mind the discussion at the beginning of this chap- 
ter regarding the value of an organization's data, managers who control budgets will still 
expect you to be as frugal and wise as possble with an organization's money.T he cost of the 
various back-up device options is affected by several factors, including the following: 


= W hat interface is used to onnec the device to the computer? T he interface is usually 
SCSI (fast and expengve), IDE (common and inexpensive, but dower), or parallel 
port (great for special applications and inexpensive, but quite dow compared to 
IDE and SCSI). 


= H ow recent is the format? T he more recently developed formats are more expensive. 
T hey generally hold more data, however. 


= How much data an one disk or cartridge hold? T he same media may be able to hold 
sightly different amounts of data when used in different devices.T he difference in 
storage is unlikely to be more than 20%, however. 


524 


Chapter14 Backing Up System Data 


m |Sthe device from a name brand manufacure? As with everything else in the computer 
industry, buying a product from a company such asIBM or H ewlett- Packard gen- 
erally costs more than buying from a start-up or relatively unknown company.T he 
start-up company may support newer technologies, however, and may actually 
have better products. N ew companies often mug compete on price until their 
quality or technology is recognized. 


= D oes the device have spedal features? The mog common of these is an automounting 
or jukebox feature, which allows you to load a number of dis or cartridges so that 
the device can create a multivolume archive without user intervention.T hese 
devices are always much more expensive than a device supporting a sngle manually 
inserted disk or cartridge.T hey are also more subject to breakdown because of the 
additional mechanisms required to automate media handling. 


Among all the devices available, your decidon may be guided by many things T he following 
list presents a ummary of factors that you should consder in selecting a device to fulfill your 
back-up strategy. T his list is not exhaustive, but it should give you a good start at consdering 
which device will be best for your needs, and also understanding why so many different 
devices are available in the market. 


m How much does the hardware devie mst to aquire? A quick survey on a major hardware 
supplier ste such as www. warehouse œm will how you prices ranging from $150 for 
a used 2 GB lomega Jaz drive to more than $10,000 for a high-capacity, name- 
brand jukebox tape cartridge system. 


= H ow rapidly does data transfer from the computer system to the back-up media? T his is less 
important if you intend to back up using a scheduled cron job in the middle of 
the night. It may suddenly become important again if you need to restore a large 
amount of data with many people waiting while you do it. D ata transfer rates are 
usually measured in megabytes per minute (M B/ min). For example, a 14 GB tape 
drive might advertise a data transfer rate of 78 M B/ min, meaning that an entire 
14 GB tape could be filled in about three hours. Faster transfer rates generally cog 
more, but the data transfer rate that you actually achieve is based on many factors, 
such as the speed of your C PU, the quality of your sytem board components, and 
the type of interface used to connect to the back-up device. 


m |Sdata randomly acessible? In other words, is it easy to back up a single file or set of 
files without restoring or going through an entire archive set? 


= Can the device you choose perform very large backups using an autoloader or jukebox 
mechanism? If it can’t, you may always need to be present during system backups 
to switch media when one is full. 


= How much does media for the device œs? You can expect media costs to be greater 
in the long term than the hardware cost if you use a device for several years. 
Determine the media costs based on your planned back-up strategy, with 
replacement media purchased regularly as recommended by the manufacturer 
to avoid storage errors. 


Hardware and Software Issues 525 


= D oes the organization already own some back-up hardware? Is the cost savings for not 
purchasing new hardware worth using the existing hardware if it relies on older or 
obsolete technologies? W hen dealing with this issue, you will often hear the term 
legacy systems Legacy systems are systems that an organization already owns. 
Working with existing systems is a common concern when organizations plan 
new computer hardware or software acquistions. 


= H ow recent is the technology of the device? Some older devices are still very viable and 
stable, but may be difficult to locate media for. If you have a nine-track tape reel in 
your office, you may have to work with a Special service bureau to read the tape 
because these devices are rarely used now. O n the other hand, very new technolo- 
gies may not have proven themselves cost effective or technologically sound. For 
example, some tape formats had problems when first released to the public because 
magnetic material flaked off of the storage tapes. 


m Does the devie rey on an industry standard that many vendors support, or is it spedfic to 
one vendor? If the device is only supplied by one vendor, can you rely on that 
vendor to be around for awhile, or could that vendor change formats or discontinue 
a product, leaving you with outdated products or the prospect of retooling your 
back-up plans? 


= H ow long is the media life? T his may not be a big issue if you are working with daily 
backups, but most organizations maintain some sort of archival backup of com- 
pany financial records, personnel records, computer program source code, and 
other electronic assets. T he media that these assets are stored on should last long 
enough so that you are not required to make an updated copy of massive amounts 
of data every two years Table 14-1 shows the anticipated life of some key materi- 
als (Paper and microfilm are included in the table for comparison, not because you 
would use them as back-up media.) N ote that the industry's experience with these 
technologies does not extend past their supposed useful life. We know that paper 
can last 500 years. N o one really knowsif C Ds can last 30 years, because they 
haven't existed that long. 


Table 14-1 Comparison of M edia Life 


Approximate useful life (before data loss 
potentially occurs) 


Archive- quality acid-free paper 500 years 

Microfilm 100 years 

CD-ROM and similar optical media 5-30 years, depending on media quality 
Hard disks and similar magnetic media 10-20 years, depending on media quality 
Reel-to-reel data tapes 15-25 years, depending on media quality 
Tape cartridges (QIC, 4mm, 8mm, etc.) 5-10 years, depending on media quality 
3.5 inch disks 2-5 years 


526 


Chapter14 Backing Up System Data 


H ow robust is the media? C an they be dropped? C an they handle the environmental 
conditions that are part of your working area (heat, humidity, dust)? M ost media 
formats are quite robust, but if you work in a factory or outdoor environment, you 
should consider these factors. 


Is the media easily transportable (if this is a requirement of your organization)? M ost system 
administrators keep the majority of back-up media near the systems that contain 
the original data. T his makes it convenient to restore data if a problem occurs But 
it’s also a good idea to take at least one copy to another location in case a fire or 
other problem destroys the back-up media located nearby. M any organizations have 
a Strict policy about regularly taking a data backup to a bank vault or other secure 
off-site location. 


D o you need to exchange data with other organiz ations or will you rely on service bureaus to 
help you process or recover data from archive media? In this case you should consult æv- 
eral service bureaus and select a media format and device that are widely available. 


H ow rdiable is the hardware device? An unreliable or faulty back-up device can corrupt 
back-up media so that no device can read them. Even if a hardware problem doen't 
corrupt media, a breakdown can interrupt your scheduled back-up times or delay 
restoring data when a problem occurs. 


Once you have selected a back-up device and media format, you are almost ready to imple- 
ment your back-up plan. But a few additional issues still remain to be resolved. T hese are dis- 
cussed in the next section. 


Verification, Permissions, and O ther Issues 


As the saying goes, “Trust everyone, but lock your doors” T he equivalent tactic when backing 
up your system is to verify your backups on a regular basis Verifying a backup is sometimes 
done as part of a back- up utility, as described later in this chapter, but you can always perform 
your own verification using steps such as these: 


1. Pick a back-up tape or dis, either at random or according to a reasonable plan. 


For example, you might decide to test a randomly chosen level 1 back-up tape 
once per week. 


. Check the file listing on the tape by querying for the contents of the back-up 


media. (T his would be equivalent to using the 1s command to see the contents of 
the back-up media W ith some media you can actually use the 1s command, with 
others you'll need to use a back-up utility.) 


. R estore a randomly selected file to the /tmp directory of your Linux system, just 


to be certain that the data in the file can be retrieved and reassembled without 
errors. If posable, do this step immediately after backing up data (on your regular 
schedule), and then compare the file you restored with the original file that you 
backed up to see that the size and contents match. 


Using Back-up Utilities 527 


W hen you back up data, exactly what information is backed up? D oes the backup include the 
contents of each file? W hat about the owner and file permissions associated with each file? 
M any times a sytem administrator will have problems after restoring a large number of files 
because the owner and group assgned to files and directories, or the file and directory per- 
missions, are not stored as part of the backup.T he consequences of this can range from no one 
being able to access his or her data once it is restored after a sytem shutdown, to everyone 
being able to access everyone else's data on the system, including the sytem configuration 
filesYou'll have to decide which is worse in your organization. 


Back-up utilities normally include options to maintain or ignore file ownership and permis- 
sons N ormally you will want to maintain this information and check it carefully when you 
verify your backups by restoring selected files. 


Another issue related to how you choose to use back-up utilities involves the compresson 
feature that most utilities provide. Tape drives typically litt a tandard capacity and a com- 
pressed capacity; back-up commands include options to compress data as it’s being archived. 
Should you use these features? Probably so, but you should also be aware of their limitations. 
By definition, when you compress data you remove the redundancy from it. T hat is, com- 
pressed data can be re-created in its original form by adding back the redundant information 
using an established set of rules. 


To understand compression better, consider this example W hen you see the words “hilo my 
nm isN chlas” you can probably understand their meaning even though part of the informa- 
tion ismissing.T he missing information is redundant— it’s not needed for you to understand 
the sentence. You can also use gandard rules (English grammar and spelling) to reconstruct 
the original sentence:“H ello my name is Nicholas.” 


The danger with using compression is that with all the redundancy removed from a set of 
information, all of the information and rules are needed in order to reconstruct the data For 
example, if you don’t peak English well, English words with missing letters are difficult to 
decipher. In the same way, if even a small part of some compressed data is lost, the original 
cannot be easly reconstructed. By leaving the redundancy in the data that you back up, you 
might make it easier to fix any problems that occur on back-up media. 


All modern back-up media formats are highly reliable, but when age, environmental factors 
like heat and dust, and regular wear and tear are working against the data you have carefully 
saved, you should consder whether compression is always necessary. 


USING 


BACK-UP UTILITIES 


M any utilities are available to back up data from a Linux system in a secure and organized 
way.T he mos widely used of these utilities are the old UNIX standbystar and cpio. Some 
of the other utilities use these programs in the background while they present a graphical 
interface to make configuration and selection of back-up options easer. Popular commercial 
back-up utilities include features such as tracking tapes for you, keeping online indexes of 
each backup that you have performed, and automating schedules for unattended backup 
(Similar to the options provided by the crontab command). 


528 


Chapter14 Backing Up System Data 


The following sections outline basic information about using these back-up utilities. 
Although a complete discussion of tar, cpio, and commercial tools is not presented here, 
you should understand enough to use these tools for basic backups and to locate more 
exhaustive information when needed. 


Using tar and cpio 


The name tar stands for tape archive it is the oldest of the back-up tools for UNIX.The 
cpio command (for œpy in and out) is newer and includes additional features compared to 
tar. cpio also reads tar- formatted files Both tar and cpio can create archive files, such 
as the .tgz format files that you may have seen when downloading Linux programs from 
Internet sites But tar and cpio can also create an archive directly on a tape cartridge or 
other back-up device without first creating a file on your hard disk. 


In order to use a tape drive or other back-up device, you must first install and configure 

that device using the information presented in previous chapters. For example, see 
Chapters 2, 3, and 4 regarding the installation of Linux and the use of kernel modules for 
adding device support. 


The tar and cpio commands operate differently.W ith the tar command you mug specify 
filesto be included in a back-up archive on the command line. By contrast, cpio alwayslooks 
in the STDIN channel for the filenames to include in an archive The tar command writes 
data to a filename or device that you provide; the cpio command always writes data back to 
ST DOUT.To compare these two methods of operation, consder the following two examples 
for creating a full backup of the /home directory.You can assume for this example that the 
device /dev/tape is configured asa tape drive. (N otice that you refer directly to a tape drive 
device; you do not mount it first.) 


tar cf /dev/tape /home 


T his command uses the c option of tar to create a new archiveT he £ option (for filename) 
followed by the device name indicates the location where the archived data will be stored. 
T he last parameter, /home, indicates which files will be archived. Because the parameter is a 
directory name, tar will include all files located within that directory.A cpio command 
equivalent to the above tar command would be: 


find /home -print | cpio -o > /dev/tape 


To use cpio, you must use the find command to generate a lig of files (one filename per 
line) for cpio to refer to.T he find command with the -print option generates this lig. 
T hose filenames are sent to cpio usng a pipe symbol because cpio reads the filenames in 
from STDIN. The > redirection operator then sends the archived files to the device 
/dev/tape.T he -o option on cpio indicates that the archive is being output— that is, that 
data is being written out.A sampler example of cpio could archive the contents of a single 
directory to a local file usng the 1s command to generate the lig of files to archive: 


ls | cpio -o /tmp/archive.cpio 


Using Back-up Utilities 529 


those of the cpio command do. 


You might have noticed that tar options do not normally include a preceding hyphen; 


The v option is normally added to both tar and cpio % that the output of the command 
is verbose, meaning that the command prints details of what it is doing to the screen.W ith that 
option added, the last example would look like this: 


ls | cpio -ov /tmp/archive.cpio 


Extracting files usng tar or cpio isa dmilar operation, but using different options If you 
had created an archive on a tape cartridge using tar, you could restore the contents of the 
tape into the current directory using this command (with the x option standing for extrac 
and the v option included to see verbose messages about command progress): 


tar xvf /dev/tape 


The cpio command uses the -i option for input, again extracting the contents of the back-up 
media into the current directory.T he -a option is also added here % that cpio will create sub- 
directories that existed in the data as required to re-create the original data organization.W hen 
using the cpio command with the -i option, cpio reads the STDIN channel to get the 
archived data; so the < redirection operator is used with the filename or archive device name. 


cpio -idv < /dev/tape 


T hese are very basic examples of tar and cpio. Each command supports dozens of options 
for features such as compressing files, preserving file attributes, controlling a tape device, set- 
ting timestampson archived data, and many other things You can review the manual and info 
pages for each command to learn more. 


Both tar and cpio rely on other Linux commands to help you create an incremental or mul- 
tilevel backup. The mog useful of these is the fina command. For example, the following 
find command will print alist of all files in the /home directory (and its subdirectories) that 
have been modified in the last day (note the -mt ime parameter): 


find /home -mtime 1 -print 


By using the lig of files generated by thiscommand as the archive lit for cpio or tar, you can 
easly create a level 2 backup, as described in the example earlier in the chapter, in which each 
level 2 backup contains all files modified ance the last level 1 backup. (In this case, this com- 
mand would be used on Tuesday; a different number of days would be used for each day of the 
week so that data changed since the beginning of the week was included in the backup.) T he 
following two commands illustrate this: 

find /home -mtime 1 -print | cpio -ov > /dev/tape 

tar cf /dev/tape ` find /home -mtime 1 -print™ 


T he options available with the £ind command make it a powerful companion to the tar 
and cpio commands. With find you can create a list of files owned by certain users, files 
modified or accessed within certain time limits, files with certain file permissions, or many 
other criteria. 


530 Chapter14 Backing Up System Data 


Other Back-up Utilities 


The tar and cpio commands can operate either with a tape drive or with back-up devices 
that rely on a standard ext2-style file sytem or standard mounting operation, such as a Jaz 
drive or a writeable CD drive. As mentioned earlier, tape drives are popular tools for back- 
ups, but they often require additional tools to manage tape indexes, tape rewinding and 
searching, and so forth. If you intend to use a tape drive, a freely available graphical utility 
worth reviewing is included with the KDE Desktop.T he utility is called kdat, or theTape 
Back-up Tool. It is included on the U tilities submenu under KDE menusin R ed H at Linux 
when usng Gnome. Most other Linux distributions will include this program on the 
U tilities submenu of the KDE main menu. 


T heTape Back-up Tool provides handy features like the following, all available from a graph- 
ical interface and menu structure (see Figure 14-5): 


KDat: <no tape> 
File Edit Help 


: 2 {fl 


a 
© gnome 


E .gnome_private 
bin 

2 arch 
OE ash 

© ash static 
2 awk 

© basename 
© bash 

2 bsh 

È cat 

© chgrp 

© chmod 

| Ea AR 


Tel iad) ke E.. 


Figure 14-5 The Tape Back-up Tool in KDE 
= Back up and restore files by dragging and dropping them between alist of the tape 
contents and a list of the hard disk contents 
a Verify tape contents from the menu 
= Manage mounting and unmounting of tape cartridges 
m Create and maintain indexes of multiple tapes 


Using Back-up Utilities 531 


m Set preferences from a graphical dialog box (see Figure 14-6) 
= Format tapes 


KDat: Options 
Default tape size:|2 [Ge =] 
Tape block size: [10240 bytes 
Tape device: Jidevitape 


Tar command: tar 
Tape drive options 


I Load tape on mount 

I Lock tape drive on mount 
I Eject tape on unmount 
I Variable block size 


OK | Apply | 


Figure 14-6 Setting preferences in the KDE Tape Back-up Tool 


T he Tape Back-up Tool is not intended to be compatible with all the high-end tape drives 
that you might consider using for your Linux servers, but it provides an easy-to-use method 
of tracking backups. It also makes it very Smple to access data from a back-up tape. 


Commercial Back-up Utilities 


T he complexities of maintaining large numbers of back-up media for large volumes of data 


led manufacturers long ago to create specialized software to help with the task. Fortunately, 
some of these tools have made their way to the Linux platform, and others appear to be 14 


forthcoming. 


The best-known back-up utility with a strong following among Linux users is BRU, the 
back-up and restore utility, from Enhanced SoftwareTechnologies (see www. bru.com).T he main 


532 Chapter14 Backing Up System Data 


screen of BRU isshown in Figure 14-7, and the scheduling tool is shown in Figure 14-8.T his 
product, which isincluded with some Linux distributions, provides features such as: 


= Multiple levels of data verification 
m Unattended operation with scheduled backups 


a Assstance in labeling large numbers of tapes, including backups that require mul- 
tiple back-up media 


= Support for numerous types of back-up devices 


= aes 
File Options Tape Help 
= / om | 
e Level 1 Y 
== bei | 
= Level 2 
= BRU 
Copyright (c) 1996-99, Enhanced Software Technologies, Inc. 
All Rights Reserved 
Status: [Ready oo 
Figure 14-7 The main screen of BRU 
File Options Tape Help 


BRU for X11 Scheduler Available Definitions: 
_- Run Weekly On: 
| | | | leve 
level? 


©® Backup on Days of the Month: 
1|2|3|4|5|6|7|8|9]|10 
11| 12| 13) 14) 15| 16| 17/18] 1920 
21 22 23) 24) 25 26 27 28 29 30 


S 
E 
e -Backup ONCE on Date: [7 Iun Current Definition: full 
| 


Time to run (24hr clock): (22 :/25 
Tape Handling: ~ 


Delete 


Status: |Ready... 


Figure 14-8 The scheduling tool in BRU 


Chapter Summary 533 


Another popular Linux back-up tool isArkeia, from Knox Software (see www.arkela.@m).T he 
Arkeia product is advertised as an enterprise network back-up solution and is consdered a 
more full-featured tool than BRU. It is designed to control backup of multiple remote systems 
from a single location, swing or restoring data from anywhere on the network. Figure 14-9 
shows a sample screen from theArkeia program. 


50 100 


A 
Pae Instant Average 
0 “Ne 150) 107 

Joes | KNOX DPK QUON > E 


[o O 


Cruise control I 3 drives 


d 
[Si MBmin | | 


Current directory 


proxima:¢:/MSOffice/Schedule 
mars:/homefhomedeviu3 
dune:c:/My Documents 

linux :/home2/pannel 
indy:/home/PATCHES/IRIX6_2 


O 02/20/97 09:13 Drive “Drive A’: creating segment ‘24° 
O 02/20/97 09:13 Drive ‘Drive C’; creating segment 710” 


Figure 14-9 The Arkeia enterprise back-up program 


Several additional back-up utilities are available for Linux or soon will be— some as part of 
high-end hardware platforms from companies such as MTI (www.mti.cm) and Legato 
(www. legato. œm). 


CHAPTER SUMMARY 


3 Creating a back-up plan to safeguard an organization’s data involves many cons dera 
tions, such as the value of data, which devices and media formats are best suited to pro- 
tecting that data, and how and when data should be backed up. 


o Many types of back-up devices are available, with the most widely used being various 
tape cartridge formats. O ptical devices have many advantages but lack the storage 
capacity of tape drives. Issues such as verifying data, compressing data, and restoring the 
root file sytem all must be considered when preparing the back-up plan. 


534 Chapter14 Backing Up System Data 


o The tar and cpio utilities can be used to create imple backups, including incremental 
or multilevel backups. N umerous free and commercial graphical utilities are available to 
make back- up maintenance more convenient for a sytem administrator. 


KEY TERMS 

backup — A copy of data on a computer system. 

back-up level — A definition of how much data is to be backed up in comparison with 
another back-up level.W hen performing a back-up operation at a given back-up level, 
all of the data that has changed since the last backup of the previous level is recorded. 

back-up media — A device where data can be stored, such as a tape cartridge, writeable 
CD, or even a floppy dix. 

back-up plan — A written document that outlines when, how, and, perhaps, why various 
files and file systems will be backed up, tored, and— when necessary— restored to prevent 
permanent data loss. 

cpio — A Linux archiving program.T he cpio command also reads archive files created 
by the tar command. 

jukebox — A back-up device that holds multiple back-up media (such as multiple tape 
cartridges or writeable C D s) and that can switch between them without assistance from 
a system administrator. 

legacy systems — Computer systems that an organization already owns T his term usually 
refers to systems that are no longer state of the art. 

restore — To copy data from a back-up location (for example, a tape cartridge) onto the 
file system where that data is normally used, and from which it was unintentionally lost. 

tar — A Linux archiving program. 


REVIEW QUESTIONS 
1. A back-up plan would normally not include the following: 
a. A list of tape drive prices 
b. Times when backups are performed 
c. The location of critical files on the system 
d. A recommended time to replace old tape cartridges with new ones 
2. Explain how the speed with which files need to be restored affects a back-up plan. 


3. It isimportant always to back up the operating system files as often as user data files. 
True or False? 


4. Which of the following is part of measuring the value of data? 


a. The value of a project that cannot be done because data needed for the project was 
destroyed 


b. The cost of a complete set of back-up media to implement a three-level back-up plan 


10. 


11. 
12. 
13. 


14. 


15. 
16. 


Review Q uestions 535 


c. The average data transfer rate of the chosen back-up device 
d. T he average storage capacity of Smilar back-up devices 


. Name two parts of a Linux system that are likely to change daily. 
. Explain why a level 1 backup is called an incremental backup. 
. Using back-up levels has the advantage of: 


a. R educing the time required to back up the entire file system 
b. M aking it easier to recover a file that has not been changed in several weeks 


c. Allowing asystem administrator to pend less time with backups but keep data 
backed up very frequently 


d. Causing all system backups to be available via a single file index 


. U Sng a gandard three-level back-up plan with the time intervals described in the 


chapter text, a user would expect never to lose more than worth 
of work. 


a. A week's 
b. A day's 
c. An hour's 
d. 20MB 


. Explain in detail why a system administrator must use back-up media from three back- 


upsin order to completely restore a system that used three back-up levels. 
Floppy disks are a useful back-up media in cases where: 

a. The cost of writeable C Ds is prohibitive. 

b. Small amounts of critical data need to be backed up. 

c. Extreme durability is a key factor in the choice of media. 

d. A high data transfer rate is critical. 

Asarule, tape cartridges can hold much more than optical media. True or False? 
Describe two advantages of a back-up device with a jukebox feature. 

U sing a SCSI interface to connect a back-up device has the advantage of: 
a. Low cost 

b. Being proprietary (controlled by one company) 

c. High data transfer rates 

d. Limited availability 


N ame five factors to consder when selecting a back-up device and media type. 
Explain the circumstances in which each would be a controlling factor in the decision. 


N ame three tape cartridge formats and comment briefly on each. 


You can expect a CD or other optical media to lat about as long as high-quality 
microfilm. True or False? 


536 


Chapter14 Backing Up System Data 


17. 


18. 
19. 


20. 


21. 


22. 


23. 


24. 
25. 


T he purpose of verifying your backups is to: 

a. Be certain that files are correctly recorded and can be restored 

b. Ascertain whether anyone has tampered with data contained in a backup 
c. Secure data from unauthorized use 

d. Compare data transfer rates among competing products 

Explain how redundancy applies to compressed data. 

The tar utility differs from the cpio utility in that: 


a. cpio always reads and writes to STDIN and ST DOUT, while tar uses command- 
line parameters. 


b. cpio isa commercial utility, while tar is free software. 

c. cpio is widely used for Internet archive files, while tar is not. 

d. cpio isan older format that is not compatible with newer tar archives. 

The utility isa commercial back-up utility from K nox Software. 
a BRU 

b. kdat 

c. Arkeia 

d. mke2fs 


Describe why the £ind command is often used with tar or cpio for incremental 
backups. 


The option causes the tar command to extract files from an 
archive file or device. 


aa 


Describe the special considerations that must be taken in order to restore the root file 
system of Linux after a hardware failure. 


N ame three removable media formats besides CD and tape cartridges. 


In the long term, back-up media are likely to cost more than the back-up device used 
to access them.True or False? 


Hands-on Projects 537 


HANDS-ON PROJECTS 
Project 14-1 


ve) [n this activity you learn more about the Arkeia and BRU commercial back-up utilities To 
complete this activity you should have aWeb browser with access to the Internet. 


1. Start your Web browser and go to www. arkeia.com. 


2. Review the Supported Platforms page. W hat comments would you make about this 
company’s support of Linux?W hat advantages do you foresee if you choose to use this 
software in a large organization that uses many types of computers? 


3. R eview the Product Features pages. (T here are many pages with different categories of 
features.) Locate three features that you understand, based on what you have learned in 
this chapter. (You will not understand all of the features after reading this chapter.) 


4. Change to the section of the Web site containing white papers (technical reports). 
Select one of the reports and read it online. 


5. If you are interested in experimenting with this software, download a copy. 
6. Go to the BRU Web ste at www.bru.com. 


7. Explore the Products and Support pages. W hat comments do you have about the dif- 
ferences in the two products? 


8. If you are interested in experimenting with this software, download a copy using the 
Download link. 


Project 14-2 


"anei In this project you explore the Web ste of a major computer sales company to learn more 
about what back-up devices are available To complete this activity you should have aWeb 
browser with access to the Internet. 


1. Start your Web browser and go to www.warehouse.com. 
2. Verify that the PC Products page is displayed, and then choose the Drives/ Storage link. 


3. Under the R emovable Storage heading on the left ade of the browser window, choose 
the Tape link. 


4. R eview the “best-selling” devices pictured. Do you recognize the formats from 
those mentioned in the chapter? Do you recognize some of the brand names of the 
manufacturers? 


5. Under PC Tape Drives on the left side of the browser window, choose Advanced 
Intelligent Tape (AIT). 

. Review the brands, capacities, and prices presented in this category of products. 

. Use the browser's Back button to return to the previous page. 

. Under PC Tape Drives on the left side of the browser window, choose Travan. 


. Review the brands, capacities, and prices presented in this category of products. H ow 
do they compare with the AIT devices? 


oOo o NO 


538 Chapter14 Backing Up System Data 


10. In the banner at the top of the screen, click on the Supplies link. 


11. Under the Supplies heading on the left side of the browser window, choose 
Magnetic Media. 

12. Under the M agnetic M edia heading on the left sde of the browser window, choose 
QIC 1/4" Tape (T his is the tape used by Travan-format drives.) N ote the prices of 
the tapes. 


13. Use the Back button on your browser and explore the prices of other tape formats. 


Project 14-3 


vente! In thisactivity you use the tar command to create a dmple data archive file and then extract 
the contents of that file into another directory. To complete this activity you should have a 
working Linux system with root access. 


1. Login to Linux as root. 

2. If you logged in using a graphical login window, open a command-line window. 
3. Enter cd /etc to change to the /etc directory. 
4 


. Enter1s -1 | less and review the filenames and file permissions that you have to 
the various configuration files in this directory. 


5. Create a tar- format archive of the configuration files in the /etc directory using 
the command tar cf /tmp/testing.tar /etc. Because you are including the 
pathname to both the testing.tar archive file and the directory containing the 
information you want to archive, you could execute this command from any location 
on the system. 


6. Enter tar cvf /tmp/testing2.tar /etc, which isagmilar command, this time 
including the v option. After you execute this command, you see a list of all the files in 
the /etc directory appear on the screen as each is added to the archive file. 


7. Change to your home directory by entering the ca command. 


8. Use the 1s command to examine the contents of your home directory. M ake certain 
you do not have a file called 1i10.conf in your home directory. (You shouldn't, but if 
you do from a previous exercise, rename it to something else to complete this project.) 


9, Enter tar xvf /tmp/testing.tar etc/lilo.conf to us the x option of the 
tar command to extract a sngle file from the tar archive that you jug created.T he file 
is placed in your current directory. N otice that because of the v option the filename is 
printed to the screen asit is extracted. 


10. Usethe 1s command to review the contents of your home directory. Do you see a file 
Called Lilo.con£? Look for an item named etc. 


11. Enter cd ete (without a forward dash) to change to the etc subdirectory of your 
home directory.T he tar command created the subdirectory in which the requested 
file was located, starting with your current directory when you issued the command to 
extract the file from the archive. 


12. Usethe 1s command again to see the 1ilo.conf file in the etc subdirectory of 
your home directory. 


Case Projects 539 


CASE PROJECTS 


1. You are working for General Linux Corporation, a relatively new company focused on 
Linux products and services.T he company has about 100 employees and has just 
become a publicly traded corporation. As news of the public stock offering becomes 
more widely known, the workload on all employees is increasing, but everyone is 
pleased with the opportunities provided by this tep.T he company’sWeb ste is also 
extra busy as potential customers and investors review information about the company 
and its products As a system administrator, do you think the three-level back-up strategy 
outlined in the chapter text is sufficient to protect the data tored on company servers? 
W hat changes would you make to that plan? 

2. Given that the company consders itself an “Internet company,’ with many large Linux 
servers and all employees working with Internet resources each day, which of the back- 
up utilities you have reviewed would you consder using? W hy? 


3. W hat factors influence your decison about which devices and media to use for your 
back-up plan at General Linux? W hat factors do you consider in valuing the data that 
you are protecting via your back-up plan? 


